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ABSTRACT 


^  A  common  problem  which  arises  In  tactical  situations  Is  the 
tracking  of  a  non-cooperative  vehicle  on  the  basis  of  noisy  measure¬ 
ments  received  at  a  tracking  station.  The  standard  estimation  proce¬ 
dures  used  in  such  circumstances  assume  that  the  vehicle  can  be  mo¬ 
deled  as  a  dynamic  system  subject  to  a  white  noise  disturbance.  In 
many  situations,  such  a  model  may  be  inappropriate  since  the  vehicle 
may  be  moving  toward  a  specific  goal,  guided  by  some  form  of  intelli¬ 
gence.  The  tracker  should  be  able  to  incorporate  knowledge  of  this 
fact  into  his  algorithm  for  computing  his  optimal  estimate. 


This  report  proposes  a  simple  model  for  such  competitive  situa¬ 
tions.  The  vehicle's  equations  of  motions  are  described  by  a  system 
of  linear  differential  equations  with  a  guidance  law  which  minimizes 
a  quadratic  cost  functional  and  moves  the  vehicle  toward  a  goal 
unknown  to  the  tracker.  Both  a  .ixed  endpoint  and  free  endpoint 
formulation  appropriate  to  this  problem  are  considered.  The  tracker‘s 
optimal  estimation  algorithm  for  both  cases  is  derived  for  the  case 
where  the  tracker  has  a  discrete  measurement  system.  It  is  shown 
that  the  tracker  can  use  a  coupled  set  of  linear  difference  equations 
to  form  his  optimal  estimate  of  both  the  vehicle's  current  state  and 
goal . 


An  example  is  presented  of  the  fixed  endpoint  variety.  The 
per for-mance  of  the  proposed  algorithm  under  a  variety  of  initial 
assumptions  is  compared  to  the  standard  estimation  algorithm. 
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ABSTRACT 


A  common  problem  which  arises  In  tactical  situations  Is  the 
tracking  of  a  non-cooperative  vehicle  on  the  basis  of  noisy* measure¬ 
ments  received  at  a  tracking  station.  The  standard  estimation  proce¬ 
dures  used  In  such  circumstances  assume  that  the  vehicle  can  be  mo¬ 
deled  as  a  dynamic  system  subject  to  a  white  noise  disturbance.  In 
many  situations,  such  a  model  may  be  Inappropriate  since  the  vehicle 
may  be  moving  toward  a  specific  goal,  guided  by  some  form  of  Intelli¬ 
gence.  The  tracker  should  be  able  to  Incorporate  knowledge  of  this 
fact  Into  his  algorithm  for  computing  his  optimal  estimate. 

This  report  proposes  a  simple  model  for  such  competitive  situa¬ 
tions.  The  vehicle's  equations  of  motions  are  described  by  a  system 
of  linear  differential  equations  with  a  guidance  law  which  minimizes 
a  quadratic  cost  functional  and  moves  the  vehicle  toward  a  goal 
unknown  to  the  tracker.  Both  a  fixed  endpoint  and  free  endpoint 
formulation  appropriate  to  this  problem  are  considered.  The  tracker's 
optimal  estimation  algorithm  for  both  cases  Is  derived  for  the  case 
where  the  tracker  has  a  discrete  measurement  system.  It  Is  shown 
that  the  tracker  can  use  a  coupled  set  of  linear  difference  equations 
to  form  his  optimal  estimate  of  both  the  vehicle's  current  state  and 
goal . 

An  example  Is  presented  of  the  fixed  endpoint  variety.  The 
performance  of  the  proposed  algorithm  under  a  variety  of  initial 
assumptions  Is  compared  to  the  standard  estimation  algorithm. 


NOMENCLATURE 

Upper  case  letters  denote  matrices. 

Lower  case  letters  denote  vectors. 

M..  -  ijth  component  of  M 

*  J 

m.  -  ith  component  of  m 

MT,  mT  -  Transpose  of  M  or  m  respectively 
•  • 

M,  m  -  Time  derivative  of  M  or  m  respectively 

M”1  -  Inverse  of  M 

6(t-s)  -  Dirac  delta  function 

6k(k-n)  -  Kronecker  delta  function 

E [x]  -  Expected  value  of  x 
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SECTION  1.  INTRODUCTION 


The  problem  of  tracking  a  non-cooperative  vehicle  Is  one  which 
frequently  arises  In  tactical  situations.  The  standard  tracking  and 
estimation  procedures  often  assume  that  the  vehicle  being  tracked 
behaves  as  a  dynamic  system  with  white  noise  disturbances  and  unknown 
Initial  conditions.  Based  on  these  assumptions,  the  tracker  processes 
a  series  of  noisy  measurements  to  obtain  an  estimate  of  the  vehicle's 
current  state  (l.e.,  position,  velocity,  etc.).  In  this  formulation, 
no  account  Is  taken  of  any  potential  Intelligence  In  the  vehicle  which 
may  be  guiding  it  toward  a  specific  goal  which,  depending  on  the 
problem  at  hand,  may  be  known  or  partially  known  to  the  tracker. 

It  is  logical  to  expect  that  the  tracker  can  Improve  his  performance 
through  the  Incorporation  of  knowledge  about  the  vehicle's  goal  and 
guidance  law  (intelligence)  Into  his  tracking  procedure.  In  this  report 
one  method  of  Incorporating  such  knowledge,  and  the  resultant  tracking 
algorithm,  will  be  developed. 


Prior  to  developing  the  tracking  algorithm  outlined  above,  it  is 
Instructive  to  consider  an  example  where  the  proposed  algorithm  po¬ 
tentially  offers  a  significant  advantage  over  the  standard  tracking 
procedures.  Assume  that  an  enemy  vehicle  is  penetrating  the  airspace 
In  which  the  tracker  is  operating.  At  the  time  tracking  is  initiated, 
the  tracker  estimates  that  three  widely  separated  potential  targets  can 
be  reached  by  the  enemy  vehicle  In  thirty  minutes.  The  tracker's  problem 
Is  not  only  to  track  the  enemy  vehicle  but  to  scramble  Interceptor 
vehicle  from  one  of  several  airfields  and  vector  them  to  a  good  inter¬ 
cept  position.  It  Is  assumed  to  take  fifteen  minutes  to  scramble  the 
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Interceptors  end  for  them  to  fly  to  the  potential  target  nearest  their 
base.  If  the  Interceptors  are  forced  to  fly  to  one  of  the  target 
areas  further  away,  there  Is  a  high  probability  that  the  eneey -*111 
reach  the  target  prior  to  the  Interceptors.  Obviously,  In  order  to 

maximize  the  probability  of  Intercept  and  avoid  scrambling  unnecessary 
vehicle's,  the  tracker  should  scramble  only  interceptors  on  the  base 

nearest  the  enenjy  vehicle's  goal.  Thus,  the  tracker  needs  to  es¬ 
timate  the  enemy  vehicle's  goal  as  well  as  Its  current  state. 

A  thorough  analysis  of  tactical  situations  such  as  that  described 
above  quickly  leads  to  a  complex  formulation  In  the  realm  of  dynamic 
game  theory  for  which  general  solution  techniques  do  not  yet  exist. 

The  approach  taken  herein  1$  a  preliminary  attempt  to  study  this  pro¬ 
blem  utilizing  optimal  control  and  estimation  theory.  The  algorithm 
developed  Is  attractive  In  that  it  Is,  In  theory,  no  more  difficult 
to  Implement  than  the  standard  tracking  and  estimation  algorithms. 

It  does,  however,  require  that  the  tracker  possess  considerable  knowledge 
about  the  vehicle  to  be  tracked. 


SECTION  2.  PROBLEM  DEFINITION 


In  tactical  situations,  such  as  the  one  described  previously,  it 
Is  convenient  to  view  the  problem  from  two  perspectives;  that  of  the 
attacker,  and  that  of  the  tracker.  The  attacker's  problem  Is  to 
determine  an  optimal  guidance  law  which  will  take  him  from  some  Initial 
state  to  his  goal.;  The  tracker's  problem  Is  to  estimate  the  attacker's 
current  state  and  terminal  state  (or  goal)  based  on  a  series  of  noisy 
measurements.  Let  us  now  develop  a  rigorous  statement  of  each  of  these 
probl ems. 

2.1  ATTACKER'S  OPTIMAL  CONTROL  PROBLEM 

The  form  of  the  optimal  control  problem  developed  In  this  section 
Is  of  the  deterministic  fixed  endpoint,  fixed  final  time  variety.  It 
is  felt  that  this  represents  the  most  appropriate  formulation  with 
respect  to  the  type  of  problem  being  considered.  It  is  recognized 
that  In  certain  Instances  It  may  be  desirable  to  formulate  the  control 
problem  as  a  free  endpoint  problem.  This  formulation  Is  treated 
subsequently. 

It  Is  assumed  that  the  attacker  has  perfect  knowledge  of  his 
position.  Prior  to  reaching  some  Initial  point,  at  which  time  the 
attacker  will  begin  following  the  optimal  guidance  law,  the  attacker 
will  have  selected  his  goal.  The  guidance  law  being  followed  prior 
to  reaching  the  Initial  point  and  the  method  of  selecting  the  goal 
are  not  considered  here  and  will  depend  on  the  nature  of  the  tactical 


problem.  However,  it  is  assumed  that  the  goal  can  be  reasonably 
chosen  prior  to  reaching  the  initial  point  and  that  once  selected,  it 
is  not  changed  enroute.  '* 

•  The  attacker  has  control  over  his  vehicle  governed  by  the 
dynamic  system  described  by  the  set  of  first  order,  linear  differ¬ 
ential  equations 

x  (t)  =  F  (t)  x  (t)  +  D  (t)  u  (t) 

where 

x  (t)  is  an  nxl  state  vector 
F  (t)  is  an  nxn  system  matrix 
D  (t)  is  an  nxm  control  matrix 
u  (t)  is  an  mxl  control  vector 

The  system  is  initially  at  the  state  x(0)  =  xQ.  Prior  to  this 
time,  the  terminal  state  (or  goal)  of  the  vehicle  x(T)  =  x^.  has  been 
selected  as  has  the  time  required  for  this  transition,  T. 

The  attacker's  problem  is  to  determine  the  optimal  i,jidance  law 
(control)  that  will  move  his  vehicle  from  x0  to  xF  in  time  T.  It 
is  assumed  that  his  guidance  law  is  chosen  such  as  to  minimize  the 
quadratic  cost  functional 


J  «  J  {  xT(t)Q(t)x(t)  +  (x(t)  -  xF)T  E(t)  (x(t)  -  xp) 

n  ' 


as  suggested  by  Luneberger  [1968] .  The  nxn  matrices  Q(t)  and  E(t) 
are  assumed  to  be  positive  semi-definite  and  the  mxm  matrix  R(t)  is 
positive  definite.  '  * 

The  cost  functional  J  presumably  represents  the  optimal  strategy 
chosen  by  the  attacker  in  the  sense  that  it  represents  operational 
and  tactical  costs  which  the  strategy  requires  be  minimized.  Opera¬ 
tional  costs  consist  of  factors  such  as  vehicle  wear,  fuel  consump¬ 
tion,  operator  fatigue,  etc.  Tactical  costs  include  noise,  vehicle 
vulnerability,  etc.  The  choice  of  the  parameters  Q,  E,  and  R  are 
beyond  the  scope  of  this  report  and  will  depend  on  the  specific  tac¬ 
tical  situation  being  considered.  However,  they  are  assumed  to  be 
unique  within  the  context  of  a  given  situation. 

2.2  TRACKING  PROBLEM 

This  paper  is  primarily  concerned  with  the  problem  from  the 
tracker's  perspective.  The  tracker's  problem  is  to  develop  estimates 
of  the  attacker's  current  and  terminal  states  based  on  a  series  of 
noisy  measurements.  Initially  it  is  assumed  that  the  tracker  has 
estimates  of  the  attacker's  initial  state  xQ  and  terminal  state  xf: 

A  A 

denoted  xQ  and  xp  respectively.  Associated  with  these  estimates  are 
covariances 

MO)  ■  E I  (v*o)  (v»o)T| 

m°>  ■ E I  (vxf)  (vxf)T  I 


The  tracker  Is  assumed  to  obtain  discrete  noisy  measurements. 

At  each  sampling  Interval,  the  tracker  obtains  a  measurement  from  the 
measurement  system 

z(tk)  ■  H(tk)  x(tk)  +  v(tK) 

where 

z(tk)  Is  an  rxl  measurement  vector 
H(tk)  Is  an  rxn  measurement  matrix 

v(tk)  Is  an  rxl  vector  of  zero  mean  white  noise  with  covariance 

E[v(tk)  vT(tn)]  -  V(tk)6k<tk-  tn) 
tk  is  the  kth  sampling  time 

For  a  discussion  of  the  case  where  the  tracker  has  a  continuous 
measurement  system,  the  reader  is  referred  to  Luneberger  [1963]. 

The  tracker  utilizing  the  set  of  noisy  measurements  and  his 
knowledge  of  the  attacker's  system  and  objectives  must  formulate  his 
estimates  of  the  attacker's  current  and  terminal  state.  It  is 
assumed  that  the  tracker  knows  the  dynamic  system  equations  governing 
the  attacker,  the  form  and  parameters  associated  with  the  attacker's 
cost  functional,  and  the  terminal  time  T.  Given  this  knowledge,  the 
tracker  knows  the  attacker's  movement  (i.e.,  state  trajectory)  to 
within  the  2 n  variables  xQ  and  xp.  Thus,  one  may  view  this  as  a 
special  type  of  linear  regression  problem. 

At  this  point,  it  Is  worthwhile  to  make  some  general  observations 

I 

concerning  the  tracker's  problem.  Upon  cursory  examination.  It  may 
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appear  that  the  set  of  constraints  imposed  upon  the  tracker  will  be 
unsatisfiable  in  the  real  world.  While  this  may  be  true  in  many  cases, 
there  are  nunerous  examples  where  it  is  reasonable  to  assune  that  the 
tracker  possesses  the  required  knowledge.  Consider  once  again  the  case 
of  enemy  vehicles  penetrating  into  the  tracker’s  airspace,  with  the 
goal  of  attacking  a  specific  ground  target.  In  general,  specific  targets 
are  preassigned  to  vehicles,  or  groups  of  vehicles,  as  are  specific 

ingress/egress  corridors  and  weapon  delivery  profiles.  In  addition, 

.* 

vehicles  fly  according  to  well  defined  doctrine  and  tactics  which 
determine  flight  speeds,  altitudes,  evasive  maneuvers,  etc.  In  addition, 
terminal  time-On-target  may  often  be  specified  to  avoid  multiple  vehicles 
appearing  over  the  target  simultaneously,  to  coincide  with  jamming  of  the 
tracker's  air  defense  radars,  etc.  Thus,  the  tracker  may,  if  provided 
with  good  intelligence  data,  possess  the  required  information. 

In  the  preceding  discussion  a  situation  in  which  it  is  plausible 
for  the  tracker  to  possess  the  knowledge  required  for  implementation 
of  the  tracking  algor ithn  to  be  developed  has  been  described.  No 
further  attempts  will  be  made  to  justify  the  possession  of  sufficient 
knowledge  by  the  tracker.  However,  it  should  be  noted  that  the 
collection  of  the  required  data  will  remain  a  significant,  and  perhaps 
the  tracker, s  greatest  problem. 
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SECTION  3.  DEVELOPMENT  OF  THE  TRACKING  ALGORITHM 
3.1  SOLUTION  OF  THE  FIXED  ENDPOINT  OPTIMAL  CONTROL  PROBLEM 

In  order  to  develop  our  tracking  algorithm  an  explicit  solution 
to  the  attacker's  optima'  control  problem,  in  a  state  feedback  form, 
is  required.  In  developing  the  solution  to  the  problem  presented  in 
Section  2.1  it  is  therefore  necessary  to  choose  a  solution  method  which 
yields  a  control  law  in  the  desired  form. 

We  now  proceed  to  solve  the  optimal  control  problem. 

Gi  ven : 

x  =  Fx  +  Du 
x(0)  =  x0,  x ( T )  =  xp 

minimize: 

J  !  >5  0/T|xTQx  +  (x-Xp)T  E(x-Xp)  +  uTRujdt 

The  time  parameter  in  the  above  expressions  has  been  suppressed. 
Throughout  the  remainder  of  this  section,  this  parameter  will  be  used 
only  where  ambiguities  would  otherwise  arise. 

The  necessary  conditions  for  optimality  are  easily  derived 
utilizing  methods  from  the  calculus  of  variations,  and  are 


Fx  +  Du 

(3.1) 

FTa  +  Qx  +  E(x-Xp) 

(3.2) 

-R’!DTA 

(3.3) 
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with  x(0)  *  xQ  and  x{T)  «  Xp.  Here,  A(t)  is  an  nxl  dimensional  adjoint 
variable.  Together  these  conditions  define  the  optimal  control  law 
and  state  trajectory.  Sufficiency  conditions  will  be  developed 
subsequently. 

Using  the  concepts  developed  in  Bullock  1966  we  seek  a  control 
law  of  the  form 

x(t)  *  S(t)x(t)  +  b(t)  (3.4) 

where  S  is  an  nxn  matrix  and  b  is  an  nxl  vector  to  be  determined. 

Differentiating  (3.4)  with  respect  to  time  yields 
x  =  SX  +  SX  +  b, 

upon  substitution  of  (3.1)  and  (3.2)  this  becomes 
Fx  +  Du  3  5A  -  5  [fTX  +  Qx  +  E(x-xF)]  +  b 

Substituting  for  u  using  (3.3)  and  x  using  (3.4) 

F(S A  +  b)  -  DR_1DTX  =  SX  -  SFTA  -  SQ(SA  +  b) 

-  SE ( S X  +  b)  +  SExF  +  b  (3.5) 

Obviously  (3.5)  can  be  satisfied  by  choosing  b  and  S  to  satisfy  the 
set  of  differential  equations 

S  3  FS  +  SFT  +  S(Q  +  E)S  -  DR_1DT  (3.6) 
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and 

b  «  Fb  +  S(Q  +  E)b  «  SExp  (3.7) 

It  Is  possible  to  select  the  boundary  conditions  for  (3.6)  and 
(3.7)  such  that  the  boundary  condition  x(T)  *  Xp  is  satisfied.  From 
inspection  of  (3.4)  if 

b(T)  *  xF  (3.8) 

S(T)  *  0  (3.9) 

the  required  boundary  condition  is  satisfied.  We  note  here  that  S  is 
symmetric  since  $(T)  is,  and  equation  (2.6)  is  a  symmetric  Ricatti 
equation. 

The  desired  state  feedback  control  formulation  can  now  be  stated. 
From  (3.3)  and  (3.4)  the  control  is  given  by 

u  *  -R'1  DTS~ 1 ( x  -  b)  (3.10) 

where  S  and  b  are  found  by  integrating  (2.6)  and  (2.7)  backward  from 
t  *  T  to  t  *  0  using  the  boundary  conditions  (3.8)  and  (3.9). 

If  the  solution  described  by  equation  (2.10)  is  to  exist,  S 
must  be  a  nonsingular  bounded  matrix  on  the  interval  0.T).  Let  us 
first  consider  the  conditions  under  which  S  is  nonsingular  on  this 
interval.  These  conditions  are  presented  in  Lemma  2.1  which  is  attribu¬ 
table  to  Bullock  1966  . 

Lemma  2.1.  For  the  fixed  endpoint  problem  the  matrix  Ricatti 
equation 
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$  «  $FT  +  FS  +  5 (Q  +  E)S  -  DR'^D1 

with  S(T)  ■  0  has  a  nonsingular  solution  on  0,T)  if  and  only  if 
there  are  no  conjugate  points  in  the  interval  0,T). 

Where  a  conjugate  point  is  defined  as: 

Definition  2.1.  If  two  neighboring  extremals  Xj(t)  and  x2(t) 
cross  at  t  *  t*.  i.e.,  xj(t*)  =  x2(t*)  then  the  extremal  is  said 
to  have  a  conjugate  point  at  t  s  t*. 

The  existence,  or  absence,  of  conjugate  points  has  significant  impli¬ 
cations  as  to  the  existence  of  a  solution  of  the  given  form  as  shown 
by  Breakwell  and  Ho  1965  .  Specifically,  there  exists  a  unique 
feedback  control  law  if  and  only  if  there  are  no  conjugate  points. 

Furthermore,  the  absence  of  a  conjugate  point  is  necessary  to  the 
existence  of  an  extremal  which  is  also  a  minimizing  trajectory.  Thus, 
provided  that  no  conjugate  points  exist  and  that  S  remains  bounded 
on  0,T),  then  equation  (2.10)  represents  a  unique  feedback  control 
law.  These  conditions  on  S  are  in  fact  sufficient  to  the  existence  of 
a  solution. 

The  behavior  of  the  solution  as  t  +  T  deserves  consideration. 

In  general,  fixed  endpoint  problem  formulations  have  the  undesirable 
property  of  calling  for  large  control  gains  as  t  -*•  T,  resulting  from 
S”1  becoming  singular.  However,  since  x  and  b  both  approach  Xp  in 
the  limit,  (x-b)  0  and  therefore  control  gains  may  remain  bounded 

and  well  defined  except  at  t  ■  T.  In  any  event,  it  has  been  found 
that  In  practice  it  Is  usually  possible  to  approximate  the  optimal 
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and  that  S  remains  bounded  on  IQ,T),  then  equation  (2.10)  represents 
a  unique  feedback  control  law.  These  conditions  on  S  are  in  fact 
sufficient  to  the  existence  of  a  solution. 

The  behavior  of  the  solution  as  t  T  deserves  consideration. 

In  general,  fixed  endpoint  problem  formulations  have  the  undesirable 
property  of  calling  for  large  control  gains  as  t  -  T,  resulting  from 
S"1  becoming  singular.  However,  since  x  and  b  both  approach  Xp  in 
the  limit,  (x-b)  -*  0  and  therefore  control  gains  may  remain  bounded 
and  well  defined  except  at  t  =  T.  In  any  event,  it  has  been  found 
that  in  practice  it  is  usually  possible  to  approximate  the  optimal 
control,  and  hence  desired  state  trajectory,  within  acceptable  limits 
when  bounding  the  feedback  control  gains.  Of  course,  each  problem 
must  be  examined  individually  as  to  the  efficacy  of  this  technique. 

There  are  several  interesting  properties  exhibited  by  this  solu¬ 
tion  which  have  significance  in  terms  of  the  development  of  the 
tracking  algorithm.  First,  S  is  independent  of  the  state  trajectory 
x(t).  Thus,  equation  (2.6)  can  be  integrated  by  both  the  attacker  and 
tracker  without  regard  to  the  attacker's  initial  or  terminal  state. 
Secondly,  the  vector  b  depends  only  upon  the  attacker's  terminal 
state.  Thus,  the  attacker  can  integrate  equation  (2.7)  once  he  has 
chosen  his  destination,  with  the  tracker’s  knowledge  of  b  at  any  time 
being  purely  a  reflection  of  his  estimate  of  Xp. 
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control,  and  hence  desired  state  trajectory,  within  acceptable  limits 

when  bounding  the  feedback  control  gains.  Of  course,  each  problem 

must  be  examined  individually  as  to  the  efficacy  of  this  technique. 

« 

There  are  several  interesting  properties  exhibited  by  this 
solution  which  have  significance  in  terms  of  the  development  of  the 
tracking  algorithm.  First,  S  is  independent  of  the  state  trajectory 
x(t).  Thus,  equation  (2.6)  can  be  Integrated  by  both  the  attacker 
and  tracker  without  regard  to  the  attacker's  initial  or  terminal  state. 
Secondly,  the  vector  b  depends  only  upon  the  attacker's  terminal 
state.  Thus,  the  attacker  can  integrate  equation  (2.7)  once  he  has 
chosen  his  destination,  with  the  tracker's  knowledge  of  b  at  any  time 
being  purely  a  reflection  of  his  estimate  of  x^. 

3.2  SOLUTION  TO  THE  TRACKING  PROBLEM 

Prior  to  the  initiation  of  tracking,  the  tracker  must  prepare  to 
accept  measurements  by  first  determining  the  attacker's  state  equation 
system  incorporating  the  feedback  control  law.  To  do  this,  he  must 
Integrate  equations  (3.6)  and  (3.7)  backward  in  time,  just  as  the 
attack er  must  do.  As  previously  noted,  the  tracker  can  integrate 
(3.6)  exactly. 

S  «  FS  +  SFT  +  S(Q  +  E)S  -  DR_1DT  (3.11) 

S(T)  «  0 

Since  S  is  symmetric,  this  requires  Integrating  only  ^n  (n+1)  linear 
simultaneous  differential  equations.  Since  the  tracker  does  not 
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know  Xp  precisely,  he  must  Integrate  equation  (3.7)  for  all  possible 

Xp.  Equation  (3.7)  Is  linear,  hence,  this  can  be  done  by  Integrating 

the  single  matrix  differential  equation  -  - 

« 

B  -  FB  +  S(Q  +  E)B  +  SE  (3.12) 

B(T)  «  I 

where  B(t)  is  an  nxn  square  matrix.  If  Xp  were  known  to  the  tracker 
then  b  could  be  obtained  from 

b  *  Bxp. 


The  tracker  can  now  describe  the  motion  of  the  attacker  by  the 
coupled  set  of  homogenous  differential  equations 

(3.13) 


x  *  Fx  -  DR'Vs^fx-BXpJ 


Xp  «  0 

where  the  explicit  expressions  for  the  control  from  equations  (3.10) 
and  (3.12)  have  been  substituted  into  equation  (3.1).  This  can  now 
be  written  in  matrix  form  as 


(3.14) 


The  original  optimal  control  problem  has  now  been  transformed  Into 
an  initial  value  problem  albeit  at  the  expense  of  doubling  the  size 
of  the  state  vector. 


X 

F  -  0R”1DTS”1  DR"loVlB 

*  ■ 

X 

m 

X 

»  m 

i 

o 

o 

XF 

-  J 
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Tht  tracker  cannot  us*  (3.14)  as  It  stands.  The  tracker  Is 
assumed  to  have  a  discrete  measurement  system  and  thus  It  Is  necessary 
to  convert  (3.14)  Into  a  discretized  form.  This  can  be  accomplished 
by  several  means,  the  selection  of  which  will  depend  upon  thfe  complexity 
of  the  system,  accuracy  desired,  etc. 


Regardless  of  the  method  used  for  the  discretization  process, 
the  development  to  follow  holds.  To  simplify  the  problem  notatlonally, 
we  Introduce  a  2n-  dimensional  state  vector 


x(k) 


x(k) 


arid  the  discrete  system  matrix  A(k) 


can  now  be  written 


The  discrete  form  of  (3.14) 


x(k+l)  »  A(k)  x(k) 

The  tracker  can  now  view  his  problem  as  one  of  estimating  the  2n- 
dimenslonal  vector  x.(k)  from  the  noisy  measurement  system 

z(k)  ■  H(k)  x(k)  ♦  v(k) 

where 


H(k)  «  H(k)  0 

and  from  his  Initial  estimate  x(0).  In  this  formulation,  the  tracker 
can  simultaneously  estimate  both  the  current  state  x(k)  and  the  attacker1 
terminal  state  or  goal.  The  problem  as  stated  Is  amenable  to  solu¬ 
tion  by  the  standard  Kalman  filtering  techniques. 
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The  required  estimation  equation  Is 


x(k+l)  »  A(k)x(k)  ♦  K(k+l)[z(k+l)  -  H(k+l)A(k)x(k)]  (3.16) 


where 


x(k)  Is  the  2nxl  estimation  vector 

K(k+1)  Is  the  2nxr  Kalman  filter  at  step  k+1 

The  Kalman  filter  and  estimation  covariances  are  defined  by 

K(k+1)  «  P(k+1  kjH7 (k+l)^H(k+l)P(k+l  k)HT(k+l)  +  V(k+l)j "2  (3.17) 

P(k+l|k)  *  A(k)P(k| k)AT(k)  (3.18) 

P(k+l| k+1)  -  -  K(k+l)H(k+l)]p(k+l|k)  (3.19) 


(3.19) 


where 


P( k+1 | K+1)  is  a  2n  x  2n  symmetric  error  covariance  matrix  equal 


to  xjk+1)  -  x(k+l)||x(k+l)  -  x(k+l)}T  J 


P(k+l|k)  Is  a  2n  x  2n  symmetric  one  stage  predicted  error  covari¬ 
ance  matrix 

V(k+1)  Is  an  r  x  r  covariance  matrix  defined  by 
Ep(k+1)  v(k+l)TJ. 

The  Initial  conditions  for  the  estimation  algorithm  are 


2.(0)  -L 


(3.20) 


and  the  associated  covariance 


P(0|0) 


P^O) 


P  (0) 


(3.21) 


While  In  general,  equations  (3.16)  through  (3.20)  cannot  be 
simplified  In  any  meaningful  way,  their  behavior  in  certain  limiting 
cases  is  readily  ascertained.  In  particular,  consider  the  case  where 

t 

Xf  is  known  exactly  by  the  tracker.  In  this  case,  P^(0)  »  0. 

Examination  of  (3.17)  to  (3.19)  reveals  that  P3  will  then  remain 
zero  for  all  time.  Hence,  once  the  tracker  knows  Xp,  it  remains 
known  for  all  time.  However,  it  should  be  noted  that  if  the  tracker 
assumes  he  knows  Xp,  when  he,  in  fact,  does  not,  and  thus  sets  Pa(0)  *  0, 
P3  still  remains  zero  for  all  time.  .  Therefore,  if  the  tracker  makes  an 
invalid  assumption  as  to  his  knowledge  of  Xp,  his  tracking  accuracy 
should  be  degraded. 


y 
I  t 


SECTION  4.  TRACKING  OF  A  VEHICLE  UTILIZING  A  FREE  ENDPOINT 

OPTIMAL  CONTROL  LAW 

*  • 

In  this  section,  a  tracking  and  terminal  state  estimation 'algo¬ 
rithm  for  the  case  where  the  attacker  Is  utilizing  a  free  endpoint 
optimal  control  formulation  will  be  developed.  The  development  of  this 
algorithm  directly  parallels  the  development  In  the  previous  sections 
for  the  fixed  endpoint  formulation,  hence,  some  details  will  be  omitted 
In  the  development  to  follow. 

The  attacker  Is  assumed  to  have  control  over  his  vehicle  described 
by  the  dynamic  system 

x(t)  -  F(t)x(t)  +  D(t)u(t)  (4.1) 

where  the  parameters  are  as  previously  described.  The  attacker's 
problem  Is  to  determine  his  optimal  control  law  which  will  move  his 
vehicle  from  Its  Initial  state,  x(0)  ■  xQ,  to  an  unspecified  terminal 
state  x(T)  In  time  T.  T  Is  assumed  to  be  known  at  time  t  «  0.  The 
optimal  control  law  Is  such  that  the  cost  functional 

J  *  x(T)  -  xp  M  x(T)  -  xp  ♦  h  0/T(xT(t)Q(t)x(t) 

+  uT(t)R( t)u(t) >dt  (4.2) 

Is  minimized.  The  nxn  matrix  Q(t)  Is  assumed  to  be  positive  semi- 
definite  and  the  mxm  matrix  R(t)  Is  positive  definite.  The  nxl  state 
vector  xf  Is  assumed  to  an  Intended  goal  which  the  attacker  wishes  to 
approach  at  time  t  •  T. 
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Utilizing  the  concepts  from  the  calculus  of  variations  leads  to 
the  set  of  necessary  conditions  for  optimality. 


x(t)  *  F( t)x(t)  +  D(t)u{t)  (4.3) 

-A  (t)  *  FT(t)A{t)  +  Q(t)x(t)  (4.4) 

u(t)  *  -R”1(t)DT(t)A(t)  (4.5) 

A(T)  =  M^x(t)  -  xp^  (4.6) 


with  the  initial  condition  x(0)  *  x.  A(t)  here  is  an  nxl  adjoint 
variable. 

It  is  necessary  from  the  tracker’s  point  of  view  to  convert  this 
problem  into  a  form  embodying  explicit  state  feedback.  Following 
Bullock  [1966]  a  solution  is  sought  of  the  form 

A(t)  *  S(t)x(t)  +  b(t)  (4.7) 

Differentiating  (4.7)  with  respect  to  time,  gives 

A'(t)  =  $(t)x(t)  +  S(t)x(t)  +  b(t) 

Substituting  from  (4.3),  (4.4),  and  (4.5)  for  x(t),  x(t)  and  u(t) 
yields 

-FTSx  «  FTb  -  Qx  *  SFx  -  SDR‘1DTSx  -  SDR'Vb  +  Sx  +  b  (4.8) 

where  the  time  parameter  has  been  suppressed.  Equation  (4.8)  can  be 
satisfied  exactly  if  S  and  b  are  chosen  such  that 
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S  =  -SF  -  FTS  -  Q  *  SDR"1DTS  (4.9) 

and 

b  =  -F  b  +  SDR_1D  Sb  '  (4.10) 

It  is  possible  to  select  the  boundary  conditions  for  (4.9)  and 
(4.10)  such  that  (4.6)  is  satisfied.  Inspection  of  (4.7)  shows  that 

S(t)  =  M  (4.11) 

b(t)  =  -Mxf  (4.12) 

are  the  required  conditions. 

From  equation  (4.5)  and  (4.7),  the  desired  feedback  control  law 
is  given  by 

u  *  -R_1DT  |s(t)x(t)  +  b(t)]  (4.13) 

Since  M  is  known  to  both  the  attacker  and  tracker,  both  can 
integrate  (4.9)  backward  in  time  with  the  boundary  condition  (4.11). 
The  tracker  is  assumed  to  have  imperfect  knowledge  of  Xp,  hence,  he 
must  integrate  (4.10)  for  all  possible  xp.  Since  (4.10)  is  linear, 
this  can  be  done  by  integrating  the  single  matrix  differential 
equation 

B  *  -FTB  +  S0R'1DtSB  (4.14) 

with  the  boundary  condition  B(T)  *  I.  If  the  attacker's  goal  were 
known  exactly  to  the  tracker,  then  he  could  determine  b(t)  from 

(4.15) 

b(t)  *  -B(t)KXp 


20 


As  for  the  fixed  endpoint  case*  the  tracker's  knowledge  of  S(t)  is 
independent  of  his  knowledge  of  the  attacker's  state  at  any  time,  and 
only  his  knowledge  of  Xp  affects  his  estimate  of  b.  •  : 

The  sufficient  conditions  under  which  a  solution  of  the  form  of 
equation  (4.13)  exists  are  given  in  Lemma  4.1  attributable  to  Bullock. 


Lemma  4.1.  For  the  free  endpoint  problem,  the  Matrix  Ricatti 
equation 

S  =  -SF  -  FTS  -  Q  +  SDR*  Vs 

with  S(T)  =  M  has  a  solution  on  10, T)  if  and  only  if  there  are  no 
conjugate  points  in  10, T). 

The  comments  concerning  conjugate  points  made  in  Section  3.2  still 
apply,  hence,  if  there  is  a  solution  to  (4.9),  then  (4.13)  in  fact 
represents  the  unique  optimal  feedback  control  law. 


The  tracker  may  now  express  the  attacker's  motion  in  terms  of  the 
initial  value  problem 


x 


Fx  -  DR'V 


(4.16) 


*F 


0 


with  the  initial  condition  that  x(0)  =  xo>  In  matrix  form  (4.16) 
becomes 


x 


fF  -  DR' Vs 


dr'Vbm 


x 

XF 


(4.17) 
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The  tracker  is  assumed  to  have  a  discrete  measurement  system  of 


the  form 

z(k)  =  H( k)x(k)  +  v{k) 

where  the  parameters  are  the  same  as  those  in  Section  2. 
the  tracker  must  convert  (4.17)  into  a  discrete  form  to 
into  his  estimation  procedures. 


The  descrete  form  of  (4.17)  can  be  written 
x(k+l)  *  A(k)x(k) 

where  x_( k )  =  [x(k)  x^ 

A(k)  is  a  2n  x  2n  discrete  system  matrix. 

Tne  tracker's  measurement  system  may  now  be  written 
_x ( k )  =  H(k)x(k)  +  v (  k ) 
where  H(k)  =  [n(k)  0] 


The  tracker's  estmation  equations  are 
x(k+l)  =  A(k)x(k)  +  K(k+1 )  j^z(k+l)  -  H(k+l)A(k)x(k)J 
The  Kalman  filter  is  given  by 

K(k+1)  *  P(k+1  i  k)HT(k+l )  |^H(  k+  l)P(k+l  |  k)HT  (k+1 )  +  V( 
and  the  error  covariance  is  given  by 


Therefore , 
incorporate  it 


(4.12) 


as 


(4.1?: 

k+ 1  )"j  *  1  (4 . 20) 
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P(k+l|k)=  A(k)P(k  !  k)AT(k)  (4.21) 

P(k+1 | k+1 )  =  [l  -  K(k+l)H(k+l)]  P(k+l|k)  (4.22) 

where  x(k),  K(k),  P(k+l|k),  P(kjk)  are  as  definded  in  Section  3. 


The  initial  conditions  for  the  estimation  algorithm  is 


x(C)  =  [  xQ  ip]1 


(4.23) 


where  xQ  is  an  estimate  of  x(0)  and  Xp  is  an  estimate  of  Xp  at  t  =  C. 


Tne  initial  condition  for  the  error  covariance  is 
Pi (0)  0 

0  P3(0)j 


P(cj0)  = 


(4 . 2d ) 


The  estimation  algorithm,  (4.19)  through  (4.24),  is  identical  to 
that  developed  in  Section  3.  The  only  difference  lies  in  the  deri¬ 
vation  of  the  matrix  A(k)  and  in  the  interpretation  of  the  estimator 
Xp.  For  the  fixed  endpoint  case,  we  note  that  Xp  is  an  exact  esti¬ 
mator  of  the  state  of  the  attacker  at  the  terminal  time.  S;  ec  i  f  i  Call,,, 
if  Xp  is  known  exactly,  then  the  attacker's  state  at  T  is  x(T)  =  x_. 

For  the  free  endpoint  case,  Xp  is  not  an  e  xact  estimator  of  the  at¬ 
tacker's  terminal  state.  Rather,  x^.  estimates  a  set  of  terminal  cc  - 
ditions  which  the  attacker  will  tend  to  approach  as  t  -  T.  Tne 
closeness  with  which  the  attacker  approaches  Xp  will,  in  general, 
depend  not  only  upon  the  parameters  in  his  control  formulation,  but 
also  the  initial  state  xQ. 
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SECTION  5.  CASE  STUDY 
5.1  PROBLEM  DEFINITION 

The  tracking  algorithms  developed  in  Sections  3  and  4'  promise 
superior  tracking  performance  compared  to  the  standard  estination 
procedures.  In  addition,  they  have  the  added  advantage  of  encon, pas¬ 
sing  terminal  state  tracking.  In  pursuing  a  case  study,  it  is  hoped 
insight  into  a  number  of  issues  surrounding  the  nature  of  this  supe¬ 
riority  can  be  gained.  While  a  single  case  study  can  never  provide 
conclusive  answers,  it  will  help  to  answer  such  questions  as  the 
relative  difficulty  of  implementing  the  proposed  algorithm,  and  the 
improvement  in  performance  obtained. 


The  optimal  control  problem  which  shall  be  considered  is  of  the 
fixed  endpoint  variety.  Given: 


Xj(t) 

0  1 

”  x  i  ( t  )” 

0 

= 

4 

0  -  0.1 

x2(t) 

1 

x(0)  «  [0.5  0. 25] 1  =  xQ>  x(20)  =  [20.0  5.0]T  =  xp 


Minimize: 


J  = 


(  T 

10  0  ' 

I  \  T 

't  O' 

XT(t) 

x(t)  +  x(t)  -  xf 

|x(t)  -  xr 

( 

o 

•— « 

o 

l  r  J 

.0  t. 

1 

+  20  u2(t) 


dt 


Using  the  techniques  of  Section  3,  this  problem  may  be  transformed 
into  an  equivalent  initial  value  problem.  First,  the  matrices  S(t)  and 
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D ( t }  are  determined.  S ( t )  is  found  by  integrating  the  matrix  Ricatti 
equation 


■o  r 

'3  O' 

'10  +  t  O' 

0  o' 

s  « 

.0  -  0.1. 

s  +  s 

.1  -  0.1. 

+  s 

0  10  +  t 

s  - 

p  .05 

(5.2) 


backward  in  time  with  the  boundary  condition 


S  ( 20 )  = 


0 

0 


0 

0. 


B(t)  is  computed  by  integrating  the  matrix  differential  equation 


'0  c 

10  +  t  0 

't  O' 

B  = 

B  +  S 

B  -  S 

.0  -0.1  . 

0  10  +  t. 

.0  t. 

backward  in  time  with  the  boundary  condition 


B  ( 20 } 


'1  O' 

.0  1. 


The  desired  initial  value  problem  is  then  given  by 


'x(t) 

7°  7  -  (°  °1  s---(t)  (»  °)  S-'ltiM 

\0  -0.1/  \0  .05 1  \0  .05 J 

T'.t:l 

*F 

(0  01  (0  01 

Li_ 

X 

f - 

o 

o 

O 

o 

/  r 

with  the  boundary  conditions  x(0)  =  x  .  The  solution  of  this  equation 
represents  the  optimal  state  trajectory  of  the  attacker. 


From  the  tracker's  point  of  view,  it  is  now  necessary  to  discret¬ 
ize  (5.4). 
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We  note  that  (5.4)  Is  time  variant  due  to  the  presence  of  S(t)  and 
B(t),  and  hence,  determining  the  state  transition  matrix  *or  (5.4)  is  non-trivial. 
Therefore,  we  shall  develop  an  approximation  to  the  transition .matrix. 


The  exact  solution  to  (5.1)  is 


where  fit  . 


This  solution  is  approximated  as 


Upon  taking  the  integral  and  replacing  t^by  k,  the  desired  form  is 


x  (k+1 )  = 


j"l  10(l  -  e'°-lM) 


0  e 


-O.lit 


x(k)  + 


10|it  -  10(1  -  e 

V  n  ( i  _“0  •  lit 


■o.lit 


10  1  -  e" 


The  discretized  form  of  (5.4)  can  be  written 


(5.5) 


where 
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ai  .  10 (l  -  e-°ALt) 

o2  - 

a3  =  loj^t  -  lo(l  -  e‘°*lAt)| 


The  tracker's  measurement  system  is 


2(k)  =  [1 


0] 


Xj(k) 

x2(k) 


+  v  ( k ) 


where  z{ k)  and  v ( k )  are  scalar  quantities  and  E [v ( k ) v ( n)]  =  V(k)d(k-n) 
=  1.  In  terms  of  the  augmented  state  vector  [x(k)  Xp]T  the  measurment 
equation  is 


z(k)  =  [l  0  0  0] 


[”x  ( k ) 


L  XF  J 


+  v(k) . 


(5.6) 


Using  the  results  of  Section  3  and  equation  (5.5)  and  (5.6),  the 
estimation  equations  are 


V1 

a‘V 

(° 

a A 

S_1(k)  (°  °3\  S-1(k)B(k) 

r  /s  .  -» 

x ( k  +  1 ) 

—  ^ 

\o 

a  J 

\° 

“J 

\°  V 

x(k) 

Xp(k+1 ) 

(° 

°) 

f1  ") 

x  F  ( k ) 

» 

v> 

0  J 

\0  1/  J 

+  K(k+1) 


z(k+l)  -  |(1  a,)  -  (0  u,)S_1(k)  (D  a3 )S~1(k)B(k)J 


n 


x(k) 


Xp(kj 


,5.7) 


where  K(k+1)  is  the  4  x  1  dimensional  vector: 


K(k+1)  = 


vM(k*ir 

Pi i(k+l  Ik) 

K: ( k+1 ) 

s 

P2»(k+l|k) 

Kj(k+1 

Pj/.k+llk) 

K* (k+1 ) 

- 

P*<k+l|k) 

-  . 

[P„  (k+lj k)  +  V(k+1)] 


-1 


(5.8) 
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P(k+l|k)  is 
of  the  form 


(5.9) 


P(k+1 1 k)  =  A ( k ) P ( k | k ) AT ( k  > 

where  A(k)  is  the  system  matrix  of  equation  (5.5).  The  expression  for 
the  covariance  P(k+l|k+l)  is  easily  expressed  in  compact  form  as 

P^tk+llk+l)  =  Pij(k+l|k)  -  Ki(k+l)Pij(k+l|k).  (5.10) 

i,  j  =  1,  2,  3,  4 

The  initial  conditions  for  the  estimations  equations  are 

i(-l)  =  (0  0)T 

Xp(-l)  =  (See  Section  5.2  ,  individual  cases  considered) 

,  fp,(-D  0 

P(-l  i -1)  = 

0  P,(-l) 

where  and  P3(-l)  is  selected  as  appropriate  to  the  indi¬ 

vidual  cases  considered.  Note  that  here  it  is  assumed  that  the  track¬ 
er's  first  measurement  occurs  at  k  =  0,  therefore,  the  tracker's 
initial  estimates  are  at  k  =  -1.  The  value  of  k  for  this  example  runs 
from  k  *  -i  to  k  -  310  .The  time  increment  At  =  0.0625. 


Prior  to  solving  the  optimal  control  and  estimation  equations 
derived  previously,  it  will  be  useful  to  formulate  this  problem  in 
terms  of  the  standard  tracking  and  estimation  procedures.  The 
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tracking  performance  of  the  algorithms  described  by  equations  (5.7) 
through  (5.10)  may  then  be  compared  to  the  nominal  performance  exhib¬ 
ited  by  the  standard  procedures. 


To  develop  the  estimation  equations,  a  discrete  description  of 
the  attacker’s  dynamic  system  with  a  white  noise  disturbance  is 
required.  This  is  given  by 


x,(k+l)' 
x2(k+l ) 


(5.11) 


where  w(k)  is  a  scalar  white  noise  disturbance  with  covariance 

E  [w(k)w(n)]  =  W(k)dk(k-n)  =  1 

a,  =  10(1  -  e‘°'lAt) 

-O.lit 

a2  *  e 

=  10Ut  -  10(1  -  e"0,1^)  ) 

W(k)  was  selected  to  reflect  the  magnitude  of  the  control  in  the 
optimal  control  problem  as  shown  in  Section  5.2. 


The  tracker's  measurement  system  is  taken  to  be 
Xj ( k)l 

z(k)  =(10)  +  v(k)  (5.12) 

x2(k) 

-  — 

where  v(k)  has  the  associated  covariance  E[v(k)v(n)]  =  V(k)6k(k-n)  =  1. 
This  is,  of  course,  identical  to  equation  (5.6).  Tne  estimation 


equations  are 


-  T1  r  .1 

x(k+l)  s  Q  a  *(k)  +  K(k+1)  [z(k+D  -  ^i(k)  >ajX2(k)J  (5.: 


where  K(k+1)  is  defined  by 

[Mk+1)l  fPn  (k+l|k) 


K(k+1)  = 


Mk+1)  Pn(k+l|k)  ,  v  , 

(Pii(k+l|k)  +  V(k+l)j  (5.14) 

Mk+1)  P2,  (k+l|k)  v  ' 


with  P^jlk+Hk)  being  the  ijtfl  component  of  P(k+l(k).  The  single 
stage  predicted  error  covariance  P(k+ljk)  is 


1  ot, 

P(k+l|k)  =  P(kjk) 

0  oij 


1  0 


al 


W(k)  [a3  (5.15) 


The  error  covariance  at  step  k+1  is  given  by 


P(k+l|k+l) 


1  -  Kj (k+1 )  0 

-Mk+1)  1 


P(k+l|k) 


(5.16) 


The  initial  conditions  for  (5.13)  through  (5.16)  are 

..  x(-l)  =  10  0]T 


p(-n-i)  =  p»t-D  = 


1  0 
0  1 


where  it  is  assumed  that  the  first  measurement  occurs  at  k  =  0  as 
before,  and  k  runs  from  k  *  -1  to  k  *  320  with  At  =  0.0625. 


5.2  RESULTS 

The  test  case  simulation  was  carried  out  on  an  IBM  Systeii/360 
Computer.  The  differential  equations  were  solved  via  a  modified 
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Hammings  method,  as  described  in  IBM 1 19 70] ,  with  a  nominal  step  size 
of  0.03125. 


The  solution  to  the  optimal  control  problem  was  first  generated. 
Equations  (5.2)  and  (5.3)  were  integrated  backward  in  time  from  t  =  20 
to  t  s  -0.03125  using  the  given  boundary  conditions.  The  optimal 
state  trajectory  was  then  found  by  integrating  equation  (5.4)  with 
x(0)  s  xQ.  The  optimal  control  law  corresponding  to  this  trajectory 
is  given  by 


u(t)  = 


o  o" 

0  -05 


S_1(t)  [x(t)  -  B(t)xFj  . 


The  State  trajectory  and  otpimal  control  law  are  portrayed  in  Figure  1. 


The  standard  (Kalman)  estimation  algorithm  was  next  employed  to 
establish  a  benchmark  against  which  the  optimal  estimation  algorithm 
developed  in  this  paper  could  be  compared.  The  measurements  z(k) 

(5.12)  for  k  =  o  to  320  were  generated  using  the  state  trajectory  from 
the  optimal  control  problem  and  noise  values  v(k)  taken  from  the  table  of 
random  normal  numbers  presented  in  Hoel ,  Port  and  Stone  [1971], 
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’3? AS* 


The  estimated  state  vector  x  ■  xj  x2  ^  and  the  associated 
covariance  matrices  and  Kalman  filter  were  generated  using  (5.13) 
through  (5.16).  The  initial  conditions  are 

x(-l)  «  [0  0]T 

P(-H-l)  «  1  °1 

0  1  J  * 

The  outputs  from  this  simulation  are  summarized  In  Table  1  and  Figure 
1.  The  error  covariance  (Pu(K))  and  measurement  (z(k))  values  for 
selected  k  values  are  shown  In  Table  1.  The  measurement  values 
(z(k) )  shown  are  the  same  as  those  used  In  the  optimal  situation  al¬ 
gorithm  simulations  presented  subsequently.  Figure  1  presents  a 
smoothed  plot  of  the  absolute  error  values  x{(k)  >  x^(k)  for  a  sam¬ 
pling  Interval  of  at»0.625. 

The  optimal  estimation  algorithm  simulations  were  carried  out 
using  Initial  conditions  covering  a  variety  of  possible  circumstances. 
In  the  first  case  simulated.  It  was  assumed  that  x^  was  known  exactly. 
The  Initial  conditions  corresponding  to  this  case  are 


x(-l)  *  0  00  20  5  T 


The  estimated  state  vector  x  ■  xx  x2  xfl  xF2  T  and  the  associated 


error  covariances  and  Kalman  filter  were  generated  using  (5.7)  through 
(5.10).  The  measurement  z(k)  (5.6)  are  identical  to  those  generated 
for  the  standard  estimation  simulation.  The  error  covariance,  for 
selected  increments  is  presented  in  Table  2  and  the  absolute  state 
estimation  errors  are  presented  in  Figure  2. 

Comparison  of  the  results  presented  in  Table  2  and  Figure  2  with 
those  in  Table  1,  indicates  the  superiority  of  the  optimal  estimation 
algorithm  as  compared  to  the  standard  estimation  procedure.  The 
error  covariance  is  one  to  two  orders  of  magnitude  smaller  after 
the  20th  iteration  and  four  orders  of  magnitude  less  by  the  150*^ 
iteration,  when  the  standard  estimation  covariance  has  reached  steady 
state,  resulting  in  a  greater  confidence  in  the  estimated  values.  This 
result  is  reflected  in  the  state  estimation  errors.  The  optimal 
estimation  algorithm  is  seen  to  have  state  estimation  errors  of  an 
order  of  magnitude  over  most  of  the  range.  It  is  apparent  from  Figure 
2  that  the  error  associated  with  becomes  large  during  the  final 
increments.  The  importance  of  this  is  debatable,  since  xf  is  known 
and  hence  the  tracker  can  detect  this  deviation.  However,  this  result 
occurs  in  all  of  the  cases  simulated  and  is  thus  worthwhile  to  investigate. 
This  result  is  caused  by  the  nature  of  the  optimal  control  law  as 
reflected  in  the  system  matrix  (5.4).  We  note  that  the  control  acts 
as  a  forcing  function  on  x^.  Near  t  *  20,  the  control  function  has  a 
large  negative  slope  (see  Figure  1)  which  causes  a  negative  predicted 

y 

estimate  of  x  given  by  *2(k+ljk)  »  A2(k)  Xj(k)  x2(k)  (k )  Xp2(k) 
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Table  2.  Error  Covariance*  for  the  Optimal  Estimation  Algorithm 
at  Selected  Increments  when  Xp  Assumed  Known 


INCREMENT  (k) 

P  *  D 

*12  *21 

P22 

0 

.5002 

.0036 

.8313 

1 

.5048 

.2790 

.8313 

2 

.2524 

.2073 

.5694 

3 

.2044 

.0347 

.46e3 

4 

.1733 

.0389 

.3834 

5 

.1517 

.0406 

.3124 

10 

.0989 

.0276 

.1060 

20 

.0543 

-.0020 

.0129 

30 

.0288 

-.0094 

.0055 

40 

.0135 

-.0072 

.0044 

50 

.0058 

-.0040 

.0029 

60 

.0021 

-.0018 

.0016 

70 

.68E-03 

- .66E-03 

.67E-03 

80 

.1 8E-03 

- .20E-03 

.25E-03 

90 

.37E-04 

-.51E-04 

.76E-04 

100 

.62E-05 

-.91E-05 

. 18E-04 

110 

.13E-05 

- .10E-05 

.35E-05 

120 

.89E-06 

- . 19E-06 

.55E-06 

130 

•55E-06 

-.22E-06 

. 17E-06 

140 

.29E-06 

-  .17E-06 

.12E.06 

150 

.12E-06 

- .86E-07 

.71E-07 

160 

.43E-07 

- .39E-07 

.37E-07 

170 

. 13E-07 

- . 13E-07 

. 1 5E-07 

180 

. 31E-08 

- .37E-08 

•47E-08 

190 

.59E-09 

- .84E-09 

.13E-08 

200 

.90E-10 

- .14E-10 

.28E-09 

210 

.15E-10 

- . 16E-10 

•47E-10 

220 

.70E-11 

-.24E-11 

.67E-11 

230 

.44E-11 

-  .20E-11 

•18E-11 

240 

.22E-11 

— . 14E-1 1 

.11E-11 

250 

.86E-12 

-.7 OE-12 

.61E-12 

260 

.28E-12 

-.27E-12 

.27E-12 

270 

.78E-13 

- .86E-13 

.96E-13 

280 

. 18E-13 

- .22E-13 

.29E-13 

290 

.33E-14 

- .48E-14 

.72E-14 

300 

.43E-15 

- .80E-15 

.15E-14 

310 

.22E-16 

- .72E-16 

.24E-15 

315 

.14E-17 

-.88E-17 

.56E-16 

320 

•37E-19 

- .18E-17 

.84E-16 

*7nVx_  MnUV.  All  p.  .  «  n.  TyTTpT  p. p,„_  p„.  pTT 


With  the  small  Kalman  filter  values  at  this  time  (same  order  of  magni¬ 
tude  as  the  error  covariance)  this  predicted  estimate  dominates  the 
state  estimate.  This  problem  is,  of  course,  a  specific  reflection  of 
the  nature  of  the  example  being  considered  and  the  time  constant 
used.  It  cannot  be  concluded  that  such  problems  will  arise  in  general. 


A  comparison  example  to  the  one  just  described  is  the  case  where 
Xp  is  assumed  known  when  It  is  in  fact  not  known.  For  this  case,  the 
initial  conditions  are 


x(-l)  *  [0  0  10  10]T 


P(-ll-l) 


1  0  0 
0  1  0 
0  0  0 


0 

0 

0 


The  absolute’  state  estimation  errors  for  this  case  are  indicated  in 
Figure  3.  The  terminal  state  error  is  constant  with 


The  error  covariances  are  those  given  in  Table  2.  Since  invalid  assump¬ 
tions  have  been  made,  one  would  expect  the  state  estimation  errors  to 
be  large  relative  to  the  preceding  case,  as  is  the  case.  Thus,  it  is 
evident  that  a  large  penalty  is  paid  for  Incorrectly  assuming  knowledge 
about  the  terminal  state. 
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There  is  a  high  probability  that  the  tracker  would  discover  his 
mistaken  assumptions  far  in  advance  of  the  terminal  time.  This  can 
be  seen  from  the  difference  in  the  actual  measurements  and'.the  pre¬ 
dicted  measurements,  or  measurement  error 

z{k)  =  |z(k)  -  H(k)A(lc-l)x(k-l)} 

This  difference  for  the  two  cases  already  described  at  selected 
increments  are  shown  below 


INCREMENT 

(k) 

z  FOR  ASSUMED  VALUE 
xF  =  [20  5]T 

OF  TERMINAL  STATE 
xF  =  [10  10]T 

50 

.2513 

2.2523 

100 

.3386 

3.7433 

150 

.2548 

4.8220 

200 

1.6143 

6.6125 

250 

.8668 

5.5247 

300 

.7063 

18.9356 

From  this  data. 

it  is  evident  that  the 

invalid  assumptions  lead  to  a 

large  measurement  error.  One  use  for  this  result  would  be  to  provide 
a  means  of  discriminating  against  an  attacker  having  a  goal  other  than 
that  of  interest  to  the  tracker. 

It  is  far  more  likely  that  the  tracker  will  assume  he  possesses 
imperfect  knowledge  of  the  attacker's  terminal  state.  The  initial 
condition  for  the  error  covariance  matrix  under  this  assumption  is 
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’l  0  0  0 

p(-lj-l)  =  0  1  0  0 
0  0  10 

0  0  0  1 

J  i 

The  error  covariance  matrix  value  at  selected  increments  is  given  in 
Table  5.  The  estimation  simulations  were  carried  out  for  two  initial 
conditions  for  x; 

x ( —  1 )  =  (0  0  20  5]T 

and 

x ( - 1 )  =10  0  10  10]T. 

The  estimation  errors  for  these  cases  are  given  in  Tables  6  and  7 
respectively. 

Comparing  the  error  covariance  values  in  Table  5  with  those  in 
Tables  1  and  2  allows  one  to  guage  the  performance  of  the  optimal 
estimation  algorithm  under  the  assumption  that  is  unknown.  Let  us 
first  consider  the  error  variances  E  and  e[(x2-Xj) 

(x2-x:)  .  These  error  variances  in  Table  5  are  essentially  an  order 
of  magnitude  lower  than  the  values  in  Table  1  after  the  10L  iteration. 

They  are  significantly  higher  than  the  error  variances  under  the 
assumption  that  xp  is  known  (Table  2).  Thus,  the  tracking  performance  should  be 
superior  (higher  confidence)  to  the  standard  procedure,  but  not  as  good  as  for 
the  case  where  Xp  is  known.  Next,  consider  the  terminal  state  estima¬ 
tion  variances  (xpj-Xpi)J  and  E |(*f2"*F*) (xF2 1  r,'0!" 

the  values  in  Table  5  (P„  and  P..J  it  is  evident  that  the  error  variance 
associated  with  xf2  is  significantly  higher  than  that  for  any  other 
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state.  The  error  variance  associated  with  xpl  Is  approximately  an 
order  of  magnitude  higher  than  that  for  x*  and  x2.  Hence*  the  terminal 

state  estimates  will  have  lower  confidence  than  the  current. state  estl- 

*  •. 

t 

mates . 

The  estimation  errors  from  the  simulations  are  presented  In  Figures 
4  and  5  and  bear  out  the  results  expected  from  the  error  variances.  For 
the  case  where  the  Initial  estimate  Is  the  true  terminal  state  (Figure 
4),  the  current  and  terminal  state  estimates  are  of  the  same  order  of 
magnitude  as  for  the  standard  estimation  algorithm  over  most  of  the 
Increments.  In  general,  the  estimates  of  x  have  an  absolute  error 
of  the  same  order  of  magnitude  as  for  the  standard  estimation  algo¬ 
rithm,  with  those  for  x2  being  an  order  of  magnitude  less.  As  expected, 
when  the  initial  guess  as  to  the  terminal  state  Is  good,  the  estimate 
remains  relatively  good.  Figure  5  presents  the  state  estimation  errors 
for  the  case  where  the  Initial  guess  of  the  terminal  state  Is  relatively 
Inaccurate.  From  Figure  5,  we  see  that  the  current  state  estimation 
error  for  xi  4s  again  of  the  same  magnitude  as  for  the  standard  estima¬ 
tion  algorithm  of  most  of  the  range  and  better  over  the  remainder,  with 
that  for  x2  being  an  order  of  magnitude  better. 

From  Increment  100  onward.  It  can  be  seen  that  the  terminal  state 
estimation  error  Is  consistent  with  the  calculated  variances.  The  esti¬ 
mate  of  xpi  being  of  the  same  order  of  magnitude  as  the  standard  algorithm 
errors,  and  for  x2  an  order  of  magnitude  greater. 


The  relative  quality  of  the  state  estimates  required  is  naturally 
a  function  of  the  problem  at  hand.  For  some  problems,  the  improved 
estimation  performance  of  the  algorithm  developed  here  may;be  super- 
fluous.  In  such  cases,  the  added  difficulty  of  implementing  this 
algorithm  make  its  use  questionable.  For  other  problems,  the  improved 
performance  may  be  highly  desireable.  In  either  case,  the  incorpora¬ 
tion  of  terminal  state  estimation  provides  valuable  information  not 
obtainable  from  the  standard  estimation  algorithm.  This  fact  will  in 
many  cases  make  the  optimal  estimation  algorithm  desirable  even  when 
the  current  state  estimates  are  no  better  than  for  the  standard 
estimation  algorithm. 
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SECTION  6.  CONCLUSIONS 


The  optimal  estination  algorithm  developed  in  this  paper  promises 
to  provide  superior  tracking  performance  than  the  standard-  estimation 

y 

algorithm.  In  addition,  terminal  state  estimation  is  incorporated 
providing  useful  information  to  the  tracker  within  the  context  of  the 
problem  as  outlined  in  the  introduction.  There  are  several  limitations 
of  the  proposed  method.  Foremost  of  these  are  the  assumption  that  the 
attacker  moves  in  an  optimal  fashion  toward  a  specific  goal  via  a 
fixed  deterministic  criteria  known  to  both  the  attacker  and  tracker. 
However,  in  many  cases,  these  assumptions  may  be  valid. 

Despite  the  limitations  of  the  proposed  algorithm,  it  represents 
one  of  the  few  attempts  to  incorporate  the  goal  seeking  nature  of  a 
vehicle  to  be  tracked  into  the  estimation  procedure.  While  the 
method  is  based  on  a  somewhat  complicated  formulation,  the  resulting 
algorithm  is  straightforward  and  computationally  is  not  significantly 
more  complex  than  the  standard  estimation  algorithm.  The  biggest 
problem  would  seem  to  be  the  increase  in  dimensionality  required. 

The  utility  of  the  proposed  algorithm  rest  primarily  on  the 
validitiy  of  the  given  problem  formulation  as  a  model  of  the  tactical 
problem  being  considered. 
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